Nonparametric Bayesian Two-Level Clustering for Subject-Level Single-Cell Expression Data

نویسندگان

چکیده

The advent of single-cell sequencing opens new avenues for personalized treatment. In this paper, we address a two-level clustering problem simultaneous subject subgroup discovery (subject level) and cell type detection (cell expression data from multiple subjects. However, current statistical approaches either cluster cells without considering the heterogeneity or group subjects using information. To bridge gap between grouping, develop nonparametric Bayesian model, Subject Cell Single-Cell (SCSC) to achieve grouping simultaneously. SCSC does not need prespecify number number. It automatically induces structures matches types across Moreover, it directly models raw count by deliberately data's dropouts, library sizes, over-dispersion. A blocked Gibbs sampler is proposed posterior inference. Simulation studies application multi-subject iPSC scRNA-seq dataset validate ability simultaneously cells.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Nonparametric Multilevel Clustering with Group-Level Contexts

We present a Bayesian nonparametric framework for multilevel clustering which utilizes grouplevel context information to simultaneously discover low-dimensional structures of the group contents and partitions groups into clusters. Using the Dirichlet process as the building block, our model constructs a product base-measure with a nested structure to accommodate content and context observations...

متن کامل

Nonparametric Bayesian Clustering Ensembles

Forming consensus clusters from multiple input clusterings can improve accuracy and robustness. Current clustering ensemble methods require specifying the number of consensus clusters. A poor choice can lead to under or over fitting. This paper proposes a nonparametric Bayesian clustering ensemble (NBCE) method, which can discover the number of clusters in the consensus clustering. Three infere...

متن کامل

A Spatiotemporal Nonparametric Bayesian Model of Multi - Subject Fmri Data

In this paper we propose a unified, probabilistically coherent framework for the analysis of task-related brain activity in multi-subject fMRI experiments. This is distinct from two-stage “group analysis” approaches traditionally considered in the fMRI literature, which separate the inference on the individual fMRI time courses from the inference at the population level. In our modeling approac...

متن کامل

Discriminative Bayesian Nonparametric Clustering

We propose a general framework for discriminative Bayesian nonparametric clustering to promote the inter-discrimination among the learned clusters in a fully Bayesian nonparametric (BNP) manner. Our method combines existing BNP clustering and discriminative models by enforcing latent cluster indices to be consistent with the predicted labels resulted from probabilistic discriminative model. Thi...

متن کامل

Automated Two Level Variable for Multiview Clustering

Clustering is used to identify the relationship among different objects from large volume of data. The clustering analysis is feasible only when the groups are formed with important features. The existing K-Means clustering processing time and the computation cost is high. The proposed two level variable weighting algorithm calculates weights for both views and variables to identify the importa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Statistica Sinica

سال: 2023

ISSN: ['1017-0405', '1996-8507']

DOI: https://doi.org/10.5705/ss.202020.0337